Why does the Standard GARCH(1,1) model work well? 
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The AutoRegressive Conditional Heteroskedasticity (ARCH) and its generalized version (GARCH) family of 
models have grown to encompass a wide range of specifications, each of them is designed to enhance the ability 
of the model to capture the characteristics of stochastic data, such as financial time series. The existing literature 
provides little guidance on how to select optimal parameters, which are critical in efficiency of the model, among 
the infinite range of available parameters. We introduce a new criterion to find suitable parameters in GARCH 
models by using Markov length, which is the minimum time interval over which the data can be considered as 
constituting a Markov process. This criterion is applied to various time series and results support the known 
idea that GARCH(1,1) model works well. 
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INTRODUCTION 

The ARCH model H and standard GARCH model H are 
now not only widely used in the Foreign Exchange (FX) liter- 
ature J2I] but also as the basic framework for empirical stud- 
ies of the market micro-structure such as the impact of news 
J3] and government interventions JH @], or inter- and intra- 
market relationships 17, 8]. Due to basic similarities between 
the mentioned models of different constants, here we focus 
our discussion on financial context. 

The main assumption behind this class of models is the rel- 
ative homogeneity of the price discovery process among mar- 
ket participants at the origin of the volatility process. Volatil- 
ity is an essential ingredient for many applied issues in fi- 
nance. It is becoming more important to have a good mea- 
sure and forecast of short-term volatility, mainly at the one 
to ten day horizon. Currently, the main approaches to com- 
pute volatilities are by historical indicators computed from 
daily squared or absolute valued returns, by econometric mod- 
els such as GARCH. In other words, the conditional density 
of one GARCH process can adequately capture the informa- 
tion. In particular, GARCH parameters for the weekly fre- 
quency theoretically derived from daily empirical estimates 
are usually within the confidence interval of weekly empirical 
estimates |9J]. It is interesting to note that despite the exten- 
sive list of models which now belong to the ARCH family 
ifiol [Til 121. the existing literature provides little guidance 
on how to select optimal q and p values in a GARCH (q,p) 
model. The two parameters q and p in GARCH model refer 
to history of the return and volatility of time series, respec- 
tively. However, there are some criteria to find the suitable q 
and p. Pagan and Sabu 11311 . suggest a misspecified volatility 
equation that can result in inconsistent maximum likelihood 
estimates of the conditional mean parameters I1 1411 . Further 
West and Cho [ 15] show how appropriate GARCH model se- 
lection can be used to enhance the accuracy of exchange rate 



volatility forecasts. However, there is an almost ubiquitous 
reliance on the standard GARCH (l,l)-type model in the ap- 
plied literature. In this paper, we suggest a new approach to 
find optimal values q and p. In our approach, a fundamental 
time scale is the Markov time scale, tnt> which is the mini- 
mum time interval over which the data can be considered as 
constituting a Markov process. As our starting point, we pro- 
ceed by stressing on the role of q and p in the GARCH model 
and their similarity to the Markov length scale in stochastic 
processes. There are reported examples of measuring Markov 
length for various data 1161 1 1711 . Recently Tabar et al., Il 811 . 
have used the dynamics of the Markov length for predicting 
earthquakes. Moreover, Renner et al. have shown that 
Markov time scale in returns of the high frequency (minutely) 
US Dollar - German Mark exchange rates in the one-year pe- 
riod October 92 to September 93 is larger than 4 minutes. 

Generally, we check whether the data follow a Markov 
chain and, if so, measure the Markov length scale, tjv/- Such 
a given process with a degree of randomness or stochasticity 
may have a finite or an infinite Markov length scale. Specifi- 
cally, the Markov length scale is the minimum length interval 
over which the data can be considered as a Markov process. 
The main goal of this paper is to utilize the mentioned simi- 
larity between the role of q and p in the GARCH model and 
Markov time scale in stochastic processes, and introduce a 
novel method to estimate optimal GARCH model parameters 
of daily data. 

GARCH BASED VOLATILITY MODELS 

The GARCH model, which stand for Generalized AutoRe- 
gressive Conditional Heteroscedasticity, is designed to pro- 
vide a volatility measure like a standard deviation that can 
be used in financial decisions concerning risk analysis, port- 
folio selection and derivative pricing. The GARCH models 
have become important tools in the analysis of time series 
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data, particularly in financial applications. This model is espe- 
cially useful when the goal of the study is analyze and forecast 
volatility. 

Let the dependent variable be labeled by r t , which could be 
the return on an asset or portfolio. The mean value fi and the 
variance a 2 will be defined relative to past information set. 
Then, the return r in the present will be equal to the mean 
value of r (that is ,the expected value of r based on past infor- 
mation) plus the standard deviation of r times error term for 
the present period. 

The challenge is to specify how the information is used to 
forecast the mean and variance of the return, conditional on 
the past information. The primary descriptive tool was the 
rolling standard deviation. This is the standard deviation cal- 
culated using a fixed number of the most recent observations. 
It assumes that the variance of tomorrow's return is an equally 
weighted average of the squared residual from the past days. 
The assumption of equal weights seems unattractive as one 
would think that the more recent events would be more rele- 
vant and therefore should have higher weights. Furthermore 
the assumption of zero weights for observations more than 
one period old, is also unattractive. The GARCH model for 
variance looks like this Q2Q: 



r t = (i + a t e t 

a\ = a + airt_i + + a p r 2 _ p 

+ + /Vt% (i) 

where r t denotes the returns and e t ~ iid N(0, 1). The con- 
stants ao up to a p and j3x up to j3 q must be estimated. Up- 
ating simply requires knowing the previous forecast a 2 and 
residual. In the GARCH(1,1) model, the weights are respec- 
tively (1 — ot\ — Px, «i, j3{) and the long run average vari- 
ance is ■ I* should be noted that this only works 
if ax + Pi < 1, and only really makes sense if the weights 
are positive requiring ax > 0, fix > and ao > 0. The de- 
scribed GARCH model is typically called the GARCH(1, 1) 
model. The first parameter in GARCH(q, p) models refers to 
the number of autoregressive lags or ARCH terms appearing 
in the equation, while the second parameter refers to number 
of moving average lags, which here is often called the num- 
ber of GARCH terms. Sometimes models with more than one 
lag are needed to find good variance forecasts. For estimating 
parameters of an equation like the GARCH(1, 1) when the 
only variable on which there are data is r t we can use maxi- 
mum likelihood approach by substituting conditional variance 
for unconditional variance in the normal likelihood and then 
maximize it with respect to the parameters. The likelihood 
function provides a systematic way to adjust the parameters 
ao, ax and f3x to give the best fit. 



ESTIMATION OF THE GARCH PARAMETERS BY 
CALCULATING MARKOV LENGTH SCALE 

We begin by describing the procedure that lead to find opti- 
mal GARCH parameters (p, q) based on the (stochastic) data 
set. As the first step we check whether the return (volatil- 
ity) of data follows a Markov chain and, if so, measure the 
Markov length scale t^j and find p (also q) equal to tu- As 
is well-known, a given process with a degree of randomness 
or stochasticity may have a finite or an infinite Markov length 
scale. The Markov length is the minimum time interval over 
which the data can be considered as a Markov process. To 
determine the Markov length tM, we note that a complete 
characterization of the statistical properties of stochastic fluc- 
tuations of a quantity r in terms of a parameter t requires the 
evaluation of the joint probability distribution function (PDF) 
Pn(ri,tx; ■ ■ ■ ; r n , t n ) for an arbitrary n, the number of the 
data points. If the phenomenon is a Markov process, an im- 
portant simplification can be made, as the n-point joint PDF, 
P n , is generated by the product of the conditional probabilities 
p(ri + x,U + x\ri, ti), for i = 1, . . . ,n — 1. A necessary condi- 
tion for a stochastic phenomenon to be a Markov process is 
that the Chapman- Kolmogorov (CK) equation 12011 . 

P{riMn,tx) = J dr 3 P(r 2 ,t 2 \r 3 ,t')P(r 3 ,t'\rx,tx). (2) 

should hold for any value of t' in the interval t 2 < t 1 < tx- 
One should check the validity of the CK equation for differ- 
ent r*i by comparing the directly-evaluated conditional prob- 
ability distributions p(r2,t2\rx,tx) with the ones calculated 
according to right side of equation (1). The simplest way to 
determine tj\t for stationary or homogeneous data is numeri- 
cal calculation of following quantity, 

S = \p(r2,t 2 \rx,tx) - J dr 3 p(r 2 ,t 2 \r 3 ,t')p(r 3 ,t'\rx,tx)\,(3) 

for given r*i and r 2 , in terms of a time interval, for example, 
t' — tx- After that, tj^ = f — tx if the S value vanishes 
(regarding the possible errors in estimating S). To find q we 
repeat the above procedure for volatility series. 

A simple formula like as a„ = a n -x + a n -2 refers that to 
obtain the present value we need to consider two successive 
past values. In fact, the Markov length of such processes is 
equal to 2. In this way, one can interpret q and p in GARCH 
(q,p) model as Markov time scales for returns and volatilities 
time series, respectively. Therefore, we suggest a new frame- 
work to find the q and p values in GARCH models through 
calculating Markov time scale for both return and volatility 
series. Based on new approach the Markov lengths in return 
and volatility series show us how many steps in these series we 
need to go back to get a good description of the process. Ad- 
ditionally, we can interpret Markov length as a kind of mem- 
ory of the data. Knowing this memory helps us to understand 
how present values of return and volatility are affected by past 
valuse. 
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FIG. 1: Comparison of the empirical probability distribution of 
TEPIX returns with the fitted normal distribution and GARCH(1,1) 
model. 

Using this approach, we have considered some time series 
related to financial markets and commodities. We have calcu- 
lated their Markov time scales for both returns and volatilities. 
Results are presented in Table 1 . Taking into account results, 
indicates that Markov time scale of returns and volatilities for 
most of such data is equal 1. This implies that GARCH (1,1) 
model can be a well established model for financial modeling 
and descriptions. The calculated likelihood values for vari- 
ous GARCH {q,p) models support our approach. There is a 
good agreement between likelihood results and what Markov 
time scale approach suggests. Normally, the GARCH (1,1) 
model works better than other GARCH models and now it 
seems that the standard model has obtained another confir- 
mation and could well capture the most important features of 
these kind of data in general. An interesting feature of our 
approach is that it works for both normal and fat-tail distri- 
butions. We have presented skewness, kurtosis and Hurst ex- 
ponent values of selected time series in Table 1 to show sta- 
tistical properties and fat-tailness of them 112 ill . They show 
different degrees of fat-tailness. Based on presented values in 
Table 1 the selected time series are far from normal distribu- 
tion and have fat-tail distributions features and are very close 
to normal distributions. For normal distributions Hurst expo- 
nent is equal to 0.5 and greater exponents indicate fat-tailness. 
S&P500 and dow-jones indices together brent oil prices have 
lower degrees of fat-tailness compared to TEPIX and NZX. 
For a better comparison of the return distribution of TEPIX as 
an example of fat-tail distributions with a normal distribution 
and GARCH(1,1) model, the empirical PDF is presented in 
Fig.l along with the fitted normal and GARCH(1,1) PDFs. 

We have used above formalism for some daily time se- 
ries such as S&P500 (20 October 1982 to 24 November 
2004), Dow-Jones( 4 January 1915 to 20 February 1990), 
New Zealand Exchange (NZX-7 January 1980 to 30 Decem- 




FIG. 2: The 5* values of daily prices of Brent Oil along with their sta- 
tistical errors for return (upper graph) and volatilities (lower graph) 
in period 20 October 1985 to 24 September 2006. 



ber 1999), Brent oil prices (4 January 1982 to 24 Septem- 
ber 2006) and Tehran Price Index (TEPIX-20 May 1994 to 18 
March 2004). 

The value of S in Eq. (3) has been calculated for two time 
series: returns and volatilities. In fig. 1 the results of S values 
related to daily prices of Brent Oil along with their statisti- 
cal errors for different time scales are shown. The interesting 
point is that our calculations show that Markov time scale for 
the daily return and volatility series in Brent oil prices is q = 1 
and p = 2, respectively. 

In summary, we introduced a new criterion to find optimal 
q and p in GARCH family models. We have shown that a 
fundamental time scale in our approach is the Markov length, 
tM, which is the minimum time interval over which the data 
can be considered as constituting a Markov process. This cri- 
terion support the success of standard GARCH (1,1) model to 
describe the majority of time series. In fact we suggest cal- 
culating Markov length for both return and volatility series to 
find proper parameters of GARCH models before proceeding 
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TABLE I: The estimated values GARCH process parameters using 
Markov length scale approach 



Time series 


Skewness 


Kurtosis 


Hurst exponent 


q 


P 


S&P 500 


1.93 


45.10 


0.44 


l 




Dow-Jones 


-0.25 


17.43 


0.53 


l 




NZX 


-1.43 


27.29 


0.61 


l 




TEPIX 


0.72 


18.50 


0.74 


l 




Brent Oil 


-1.25 


34.99 


0.51 


2 





to describe such processes and now we have a new method to 
select optimal q and p parameters in a GARCH (q, p) models. 
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